Convex Optimization for Parameter Synthesis in MDPs

نویسندگان

چکیده

Probabilistic model-checking aims to prove whether a Markov decision process (MDP) satisfies temporal logic specification. The underlying methods rely on an often unrealistic assumption that the MDP is precisely known. Consequently, parametric MDPs (pMDPs) extend with transition probabilities are functions over unspecified parameters. parameter synthesis problem compute instantiation of these parameters such resulting We formulate as quadratically constrained quadratic program, which nonconvex and NP-hard solve in general. develop two approaches iteratively obtain locally optimal solutions. first approach exploits so-called convex–concave procedure (CCP), second utilizes sequential convex programming (SCP) method. techniques improve runtime scalability by multiple orders magnitude compared black-box CCP SCP merging ideas from optimization probabilistic model-checking. demonstrate satellite collision avoidance hundreds thousands states tens their wide range commonly used benchmarks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Algorithms for CVaR Optimization in MDPs

In many sequential decision-making problems we may want to manage risk by minimizing some measure of variability in costs in addition to minimizing a standard criterion. Conditional value-at-risk (CVaR) is a relatively new risk measure that addresses some of the shortcomings of the well-known variance-related risk measures, and because of its computational efficiencies has gained popularity in ...

متن کامل

Antenna array pattern synthesis via convex optimization

We show that a variety of antenna array pattern synthesis problems can be expressed as convex optimization problems, which can be (numerically) solved with great eeciency by recently developed interior-point methods. The synthesis problems involve arrays with arbitrary geometry and element directivity, constraints on far and near eld patterns over narrow or broad frequency bandwidth, and some i...

متن کامل

Sequential Convex Programming for the Efficient Verification of Parametric MDPs

Multi-objective verification problems of parametric Markov decision processes under optimality criteria can be naturally expressed as nonlinear programs. We observe that many of these computationally demanding problems belong to the subclass of signomial programs. This insight allows for a sequential optimization algorithm to efficiently compute sound but possibly suboptimal solutions. Each sta...

متن کامل

Budget Optimization for Sponsored Search: Censored Learning in MDPs

We consider the budget optimization problem faced by an advertiser participating in repeated sponsored search auctions, seeking to maximize the number of clicks attained under that budget. We cast the budget optimization problem as a Markov Decision Process (MDP) with censored observations, and propose a learning algorithm based on the wellknown Kaplan-Meier or product-limit estimator. We valid...

متن کامل

Natasha: Faster Non-Convex Stochastic Optimization via Strongly Non-Convex Parameter

Given a nonconvex function f(x) that is an average of n smooth functions, we design stochastic first-order methods to find its approximate stationary points. The performance of our new methods depend on the smallest (negative) eigenvalue −σ of the Hessian. This parameter σ captures how strongly nonconvex f(x) is, and is analogous to the strong convexity parameter for convex optimization. At lea...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Automatic Control

سال: 2022

ISSN: ['0018-9286', '1558-2523', '2334-3303']

DOI: https://doi.org/10.1109/tac.2021.3133265